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(57) ABSTRACT 

According to one embodiment, a method of transferring data 
across a semiconductor chip comprises transmitting data 
from a first Rambus asic cell to a second Rambus asic cell 
using clock pulses generated at a first clock generator and 
sampling the data at the second Rambus asic cell using clock 
pulses generated at a second clock generator. 
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CROSS CHIP TRANSFER MECHANISM FOR panying drawings of various embodiments of the invention. 

A MEMORY REPEATER CHIP IN A DRAM The drawings, however, should not be taken to limit the 

MEMORY SYSTEM invention to the specific embodiments, but explanation and 

understanding only. 
5 FIG. 1 is a block diagram of one embodiment of a 
computer system; 

The present invention relates to memory systems; more FIG. 2 is a block diagram of one embodiment of a 

particularly, the present invention relates to transferring data memory controller coupled to a main memory device via a 

across a memory repeater chip in a Rambus memory sub- repeater; 

system, pjQ 3 is a block diagram of one embodiment of a 

BACKGROUND repeater; 

_ ^ FIG. 4 is a block diagram of one embodiment of a transfer 
A Rambus Dynamic RAM (RDRAM) developed by 

Rambus, Inc., of Mountain View, Calif., is a type of memory . * ^ . ,. /• 

that permits data transfer operations at speeds yp to 1.2-1.6 »^ ^ ^^g''^ °^ "^Mary repeater. 

gigabytes per second. RDRAM devices are typically housed DETAILED DESCRIPTION 

in Rambus in-hne memory modules (RIMMs) that are 

coupled to one or more Rambus channels. Typically, the A mechanism for providing cross chip transfers at high 

expansion channels couple each RDRAM device to a speeds of communication is described. In the following 

memory controller. The memory controller enables other ^ detailed description of the present invention numerous spc- 

devices, such as a Central Processing Unit (CPU), to access cific details are set forth in order to provide a thorough 

the RDRAMs. understanding of the present invention. However, it will be 

RDRAM based memory subsystems may include repeat- apparent to one skilled in the art that the present invention 

eis coupled to the expansion channel that monitor the practiced without these specific details. In other 

expansion channel for activity and repeat the activity on one instances, well-known structures and devices are shown in 

or more of the stick channels coupled thereto. FIG. 5 is a Week diagram form, rather than in detaQ, in order to avoid 

block diagram of an exemplary repeater. The repeater obscuring the present invention. 

includes a slave.Rambus Asic CeU-(RAG)-and-two-master— ^ FIG. 1 is a block diagram of oae embodiment of a 

RACs. The slaveRAC is coupled to'the expansionchannel, computer system 100. Computer system 100 includes a 

while the master RACs are each coupled to a stick channel. central processing imit (processor) 105 coupled to processor 

The RACs are used to interface with the high frequency bus HO. In one embodiment, processor 105 is a processor in 

expansion or stick channels. Typically, a plurality of the Pentium® family of processors including the PentiumfS) 

RDRAM devices are coupled to each of the slick channels. II family and mobile Pentium® and Pentium® II processors 

The slave RAC operates various portions of its logic on 35 available from Intel Corporation of Santa Clara, Calif, 

one of three clocks (e.g., a slave receive clock and two slave Alternatively, other processors may be used. Processor 105 

transmit clocks). The master RACs also operate their logic °iay include a first level (LI) cache memory (not shown in 
using three different clocks (e.g., a master receive clock and 

two master transmit clocks). In the described mechanism, In one embodiment, processor 105 is also coupled to 

the receive clocks in the slave RAC and master RACs are 40 cache memory 107, which is a second level (L2) cache 

synchronized via a phase locked-loop (PLL) in order to memory, via dedicated cache bus 102. The LI and L2 cache 

transfer data within the repeater at speeds up to 400 Mhz. memories can also be integrated into a single device. 

Whenever commands and data is to be vmtten to an Alternatively, cache memory 107 may be coupled to pro- 

RDRAM on one of the stick channels, the data must be oessor 105 by a shared bus. Cache memory 107 is optional 

transmitted from the expansion channel through the slave 45 and is not required for computer system 100. 

RAC and across the repeater chip to a master RAC. The Chip set 120 is also coupled to processor bus HO. In one 

slave RAC receives commands and data with the slave embodiment, chip set 120 is the 440BX chip set available 

receive clock and uses the slave receive clock to transmit the from Intel Corporation; however, other chip sets can also be 

commands and data across the chip to the particular master used. Chip set 120 may include a memory controller for 

RAC, 5Q controlling a main memory 113. Further, cbdp set 120 may 

A problem exists, however, in sampling the command and be coupled to a video device 125 that handles video data 

data signals at a master RAC at such a high speed. Consid- requests to access main memory 113. In one embodiment, 

cring clock jitters and phase errors, sampling the signals video device 125 includes a video monitor such as a cathode 

reliably in a master RAC with the master receive clock is ray tube (CRT) or liquid crystal display (LCD) and neces- 

often difficult because the lack of sufficient hold times. For ss sary support circuitry. 

instance, hold time violations would imply that delay must Main memory 113 is coupled to processor bus 110 

be added to the signal path. However, adding the requisite through chip set 120. Main memory 113 and cache memory 

time delay to fix the hold time problem could potentially 107 store sequences of instructions that are executed by 

cause a setup time violation. Taking into consideration the processor 105. In one embodiment, main memory 113 

problems stated above and the long cross chip distance for eo includes a Rambus dynamic random access memory 

data transfer, it would be desirable to provide a mechanism (RDRAM) system; however, main memory 113 may have 

for providing cross chip transfers at high speeds of commu- other configurations. The sequences of instructions executed 

nication. by processor 105 may be retrieved from main memory 113, 

RRIFF OFSPRTPTrnM OF THF HR AWTKr,<5 memory 107, or any other storage device. Additional 

BRIEF DESCRIPTION OF THE DRAWINGS ^^^-^^^ ^^^^^^^ processor bus 110, such as 

The present invention will be understood more fully from multiple processors andt/or multiple main memory devices, 

the detailed description given below and from the accom- Computer system 100 is described in terms of a single 
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processor; however, multiple processors can be coupled to 250. STCLK is used to drive sigoals received from master 

processor bus 110. RACs 255 out to the expansion channel, while MTCLK is 

Processor bus UO is coupled to system bus 130 by chip to drive signals received &om slave RAC 252 out to the 

set 120. la one embodiment, system bus 130 is a Peripheral attached stick channel. 

Component Interconnect (PCI) bus adhering to a Specifica- 5 STCLK90 generator 310 and MTCLK90 generator 360 

tioa Revision 2.1 bus developed by the PCI Special Interest generate STCLK90 and MTCLK90 clock pulses, respec- 

Group of Portland, Oreg.; however, other bus standards may lively. STCLK90 and MTCLK90 are used to generate trans- 

also be used. Multiple devices, such as audio device 127, mit clocks for transmission of data. According to one 

may be coupled to system bus 130. embodiment, STCLK90 leads SRCLK and MTCLK90 leads 

Bus bridge 140 couples system bus 130 to secondary bus MRCLK by ninety degrees (90**) (e.g., Va clock cycle). In 

150. In one embodiment, secondary bus 150 is an Industry addition, STCLK leads STCLK90 and MTCLK leads 

Standard Architecture (ISA) Specification Revision 1.0a bus MTCLK90 by an output buffer delay. SRCLK generator 315 

developed by International Business Machines of Armonk, and MRCLK generator 365 generate SRCLK and MRCLK 

N.Y. However, other bus standards may also be used, for clock pulses, respectively. SRCLK and MRCLK are used to 

example Extended Industry Standard Architecture (EISA) sample command and data signals received at repeater 250 

Specification Revision 3.12 developed by Compaq from the expansion channel and stick channel, re^ectively. 

Computer, et al. Multiple devices, such as hard disk 153 and For example, SRCLK is used to sample signals at slave RAC 

disk drive 154 may be coupled to secondary bus 150. Other 252 from the expansion channel. 

d^vices^such as cursor control devices (not shown in FIG. Further, SRCLK is used to transmit the signals from RAC 
A), may fee coupled to secondary bus 150. ^ 252 to master RAC 255 across repeater 250. According to a 

( FIG. 2 is- a block diagram of one embodiment of a further embodiment the clock generators described above 

Vemory/controller 220 coupled to main memory 113, are implemented using a delay-locked loop (DLL). 

Memo^ controller 220 includes a Rambus Asic Cell (RAC) However, one of ordinary skill in the art will appreciate that 

/225r^R AC 225 is used to interface to a high frequency one or more of the clocks may be,gencrated using other 

/ expans ionchannel. The expansion channel maybe driven at devices, such as phased-lockcd loops (PLL). 

^ 4'(50'Mhz, and transfer data on the rising and falling edge of Synchronization circuit 320 synchronizes SRCLK with 

( an expansion channel clock. According to one embodiment, MRCLK. According to one embodiment, synchronization 

) R AC 225 is implemented as a Rambus Asic cell (RR AC). circuit 320 is implemented using a phased-locked loop 
^ Main memory 113 includes repeater 250 coupled to 30 (PLL). However, one of ordinary skill in the art will appre- 

memory controller 220 via the expansion channel. Repeater ciate that other devices, such as a delaylocked loop (DLL) 

2504s_coupled to two stick channels (Stick 1 and Stick 2). may used to implement synchronization circuit 320. Flip- 

<^R'5eater^^monitorsIthe_expabs^ activity Aop 325 samples data received at slave RAC 252 from the 

^ ffamrtn^Sory controller 220 and repeats the activity on one expansion channel. According to one embodiment, flip-flop 
or more of the stick channels. The stick channe ls coupled to 35 325 is a D-flip-flop. However, in other embodiments, other 

repeater 250 function as an extension of the expansion types of flip-flops may be used. 

channel. According to one embodiment, each stick channel Transfer unit 370 is coupled to slave RAC 252 and 
includes 32 DRAM (or memory) devices that are included receives MTCLK and MTCLK90, Transfer unit 370 is used 
within one or more memory modules. Alternatively, each to receive signals from slave RAC 252 for transmission out 
stick channel may include other quantities of DRAM 4Q to the attached stick channel. According to one embodiment, 
devices. Ordinarily, a maximum of 32 memory devices may whenever command and data signals are to be written to a 
be directiy coupled to the expansion channel Therefore, the memory device on the stick channel, the signals are trans- 
stick channels coupled to repeater 250 permit up to 128 mitted from the expansion channel through slave RAC 252 
memory devices to be accessed by memory controller 220. and across the repeater 250 chip to master RAC 255. The 
In addition, fepater~250:may incJMe;^^ transfer of data between, slave RAC 252 and master RAC 
255~and-onc- jiave _RAC;252.^As described above >?^th 255 is carried out using the same operating frequency as the 
respect to R^^^,~EtAC 252"inte^ce^ expansion channel. As described above, slave RAC 252 
repeater 250 with^^ exp^s^TchiBHelT^ 255 receives the signals and transmits the signals across repealer 
' aro used to interface the logic within rq)eatcr 250 to the stick to a master RAC 255 using SRCLK. The signals are 
channels. RACs 2 55 transmit and recei ve clock signals tiiat 50 subsequentiy received at transfer unit 370 using MTCLK90 
have a fixcd~7elationship between a receive clock and a and transmitted to the stick channel using MTCLK 
transmit clock. Meanwhile, for RAC 252 there is no rcla- FIG. 4 is a block diagram of one embodiment of a transfer 
tionship the received clock signals and transmit clock sig- unit 370. Transfer unit 370 includes latches 410 and 430, a 
nals. ^ match circuit 420 and a select circuit 440, Latch 410 
nG.3isablockdiagramof one embodiment of a repeater 55 receives command and data signals from slave RAC 252. 
250 with a cross chip communication mechanism. As dis- According to one embodiment, the signals received from 
cussed above, repeater 250 includes slave RAC coupled to slave RAC 252 at latch 410 are sampled using MTCLK90. 
a master RAC 255. Slave RAC 252 includes two slave As described above, MTCLK90 leads and MRCLK by 
transmit clock generators (STCLK 305 and STCLK90 310), one-fourth (^4) of a clock cycle. Thus a suflBcient hold time 
a slave receive clock generator (SRCLK) 315, a synchroni- 60 (of approximately Va of a clock) is provided in order to 
zation circuit 320 and a latch 325. Master RAC 255 includes sample the signals received from slave RAC 252. According 
two master transmit ■ clock generators (MTCLK 355 and to one embodiment, data transmitted between slave RAC 
MTCLK90 360), a master receive clock generator 252 and transfer unit 370 takes approximately three-fourths 
(MRCLIQ 365 and a transfer xmit 370. STCLK generator (^/*) of a clock cycle to propagate across repeater 250. 
305 and MTCLK generator 355 generate STCLK and 65 Therefore, sufficient setup time is also provided. 
MTCLK clock pulses, respectively, STCLK and MTCLK Match circuit 420 is coupled to and receives data from 
are used to transmit command and data signals from repeater flip-flop 410. Match circuit 420 is used to determine which 
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RAC 255 is to transmit (or repeat) the packet. According to clock signals generated by the third clock source is one- 
one embodiment, since there are two stick channels per fourth of a cycle ahead of clock signals generated by the 
repealer, one of the two match circuits 420 will indicate a second cbck source. 

match. According to one embodiment, match circuit 420 is 6. The computer system of claim 4 wherein the second 

a decoder. Latch 430 is coupled to match circuit 420 and 5 Rambus asic cell comprises: 

receives the signals to be transmitted from master RAC 255. the second clock source; 

According to one embodiment, the signals received at latch t^e third clock sotiroe- and 

430 are sampled using MTCLK in order to re -time the data ^ transfer unit 

to be transmitted to the attached stick channel. ^ ^^^^^^ ^^^^^^ g ^^^^^-^ ^^^^^^ 

According to one embodiment, latches 410 and 430 are lo comprises; 

D-latches. However, one of ordinary skill in the art will ^ flip-flop- and 

appreciate that other types of latches may be used. Select a first latch coupled between the flip-flop and the fi^tsUck 

Circuit 440 IS coupled to latch 430 and receives command channel ^ 

and data signals that are to be transmitted to the stick g *Vhe™computer system of claim 7 wherein the fi^t 

charmel. Select circuil 440 selects between the command ti t_ • n • 

J , . - . jj • I r I. J- * Rambus asic cell comprises: 
and data signals and an odd path signal. Ln one embodiment, 

data is transferred on rising and falling edges of a channel ^ ^^^^^ ^^^^^^ 

clock. In such an embodiment, the falling edges are referred a second latch coupled between the expansion channel 

to as the even clock and the rising edges are referred to as and the flip-flop. 

the odd clock. However, one of ordinary skill in the art will 20 9. The computer system of claim 1 wherein the memory 

appreciate that such references may be reversed. device is a rambus dynamic random access memory. 

According to one embodiment, the command and data ^ °^*^«^y 'y^^J^ compr^ing a repeater, wherein the 

signals are selected on the falling edge the MTCLK, while P<^^/_" ^ °f received at a first 

the odd path signals are selected on the rising edge, Rambus asic ceflusmg clock pulses generated at a first clock 

Nevertheless, it will be appreciated that select circuit 440 ^ generator and receives the first set of signals at a second 

may operate according to different select signals. Further. ^^^^ ^^^^ ^^^^ ^^^^^ P^^*^'* generated at a second 

select circuit 440 may be implemented using a multiplexer. clodt generator. ^ i • -a t. • ... ^ . . 

Although transfer unit 370 has been described as including ^^ 10 wherem the first set 

matchcircuit420andselectcircuit440,oncof ordinary skiU ^'^^ is received at the first Rambus asic cell from an 

in the art will appreciate that these devices may be excluded expansion channe coupled to the first Rambus asic cell 

from transfer unit 370 without altering the scope of the ^^^f ^^^^^ P^^^ generated at the first clock generator. 

invcntioiL memory system of claim 10 further comprising a 

I. J j-rf: *• # *u first stick channel, wherein the first set of signals is trans- 

wneieas many a.erauons ana moomcauons oi tne ^^^^ ^^^^^ 

asic cell to the first stick 

presentmvenbonwiU no doubt become apparent to a person 3^ ^j^^^^j ^ ^ ^^^^ ^j^^ 

of ordmary skiU in the art after having read the foregoing gj-^tor & & & 

description, it is to be tmderstood that any particular embodi- lan, * fi-iiu*ii,i« 

, *V J . -u J u -n * *• • • 13. The memory system of claim 12 wherem clock pulses 

ment shown and described by way of illustration is in no * j u ^ i i * • u i? * 

. , . J ^ , -J J 1- %• Tn, f f generated by the second clock generator is one-half of a 

way intended to be considered Imiitmg. TTierefore, refer- ^^j^ ^^J^^ l^^^^^ 

ences to detaik of various embodmenu are not mtended to ^ ^^^^^^ ^j^^^ ^ ^^^^^^^^ 

hmit the scope of the claims which m themselves recite only * e JL r ^ u t ^ i i 

r . J J .1- • generator is one-fourth of a cycle ahead of clock pulses 

those features regarded as the mvention. ^^^^^^^^^ ^^^^^ ^^^^^ '^^^^^^^^^ 

Hius, a mechanism for providmg cross chip transfers at ^4 computer system of claim 12 wherein the second 

high speeds of communication has been described. Rambus asic ceU comprises: 

What is claimed is: a< 4U j 1 1 * 

^ . , . . . the second clock generator; 

1. A computer system comprising: 

, „ J the third clock generator: and 

a memory controller, and - . 

, . 1 J . .1. ^11 a transfer unit, 

a memory device coupled to the memory controller, le^^ . ^ ri-ivii_-*L* c 

. y J • • 1 J * *i. ! IS. The computer system of claim 14 wherein the transfer 

wherem the memory device includes a repeater that rises- 

transmits a first set of signals received at a first Rambus so ^ oompnses, 

asic cell using a first clock source and receives the first * mp-ilop, and 

set of signals at a second Rambus asic cell using a a latch coupled between the flip-flop and the first stick 

second clock source. channel. 

2. The computer system of claim 1 wherein the first set of 16- The computer system of claim 10 wherein the memory 
signals is received at the first Rambus asic cell from the 55 system is a Rambus dynamic random access memory, 
memory controller via an expansion channel coupled 17. A rambus repeater comprising: 

between the first Rambus asic cell and the memory control- a first Rambus asic cell; and 

ler. a second Rambus asic cell coupled to the first Rambus 

3. The computer system of daim 2 wherein the first set of asic ceU wherein the first Rambus asic cell uses clock 
signals is received at the first Rambus asic cell using the first 60 pulses generated at a first dock generator to receive a 
clock source. first set of signals and the second Rambus asic cell uses 

4. The computer system of daim 1 wherein the first set of clock pulses generated at a second clock generator to 
signals is transmitted from the second Rambus asic cell to a receive the first set of signals. 

first stick chaimel using a third cbck source. 18. The repeater of daim 17 further, wherein the first set 

5. The computer system of claim 4 wherein clock signals 65 of signals is transmitted from the second Rambus asic ceU to 
generated by the second dock source is one-half of a cyde a first stick channel using clock pulses generated at a third 
ahead of clock signals generated by the first clock source and clodc generator. 
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19. The repeater of claim 18 wherein clock pulses gen- 
erated by the second clock generator is one-fourth of a cycle 
ahead of dock pulses generated by the first clock generator 
and clock pulses generated by the third clock generator is 
one-fourth of a cycle ahead of clock pulses generated by the 
second clock generator. 

20. The computer system of claim 18 wherein the second 
Rambus asic cell comprises: 

the second clock generator; 
the third clock generator; and 
a transfer \mit, 

21. The computer system of claim 20 wherein the transfer 
unit comprise^ 

a flip-flop; and 

a latch coupled between the flip-flop and the first stick 
channel. 



8 



22. A method of transferring data across a memory system 
comprising: 

transmitting data firom a first Rambus asic cell to a second 
Rambus asic cell using clock pulses generated at a first 
5 clock generator; and 

sampling the data at the secoad Rambus asic cell using 
clock pulses generated at a second clock generator. 

23. The method of claim 22 further comprising sampling 
the data at the first Rambus asic cell from an expansion 

10 channel using clock pulses generated at the first clock 
generator. 

24. The method of claim 22 further comprising transmit- 
ting the data from the second Rambus asic cell to a stidc 
channel using clock pulses generated at a third clock gen- 

15 erator. 
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